# Long text processing
Minicpm4 0.5B
Apache-2.0
MiniCPM4 is an efficient large - language model designed specifically for edge devices. Through systematic innovation, it achieves extreme efficiency improvements in four key dimensions: model architecture, training data, training algorithm, and inference system.
Large Language Model
Transformers Supports Multiple Languages

M
openbmb
415
20
Minicpm4 8B
Apache-2.0
MiniCPM4 is an efficient large language model designed specifically for edge devices. Through systematic innovation, it achieves extreme efficiency improvements in four dimensions: model architecture, training data, training algorithm, and inference system. It can achieve over 5 times faster generation speed on edge chips.
Large Language Model
Transformers Supports Multiple Languages

M
openbmb
643
103
Qwen3 4B GGUF
Apache-2.0
Qwen3-4B is the latest version in the Qwen series of large language models with 4B parameters, supporting switching between reasoning and non-reasoning modes, excelling at inference, instruction following, and multilingual processing.
Large Language Model
Q
QuantFactory
341
1
Qwen3 8B AWQ
Apache-2.0
Qwen3-8B-AWQ is the latest generation of large language model with 8.2B parameters in the Tongyi Qianwen series, which uses AWQ 4-bit quantization technology to optimize inference efficiency. It supports the switching between thinking and non-thinking modes and has excellent reasoning, instruction-following, and intelligent agent capabilities.
Large Language Model
Transformers

Q
Qwen
13.99k
2
Qwen3 8B GPTQ Int4
Apache-2.0
Qwen3-4B is the latest large language model in the Qwen series, featuring the ability to switch thinking modes, powerful reasoning capabilities, excellent human preference alignment, outstanding agent capabilities, and multilingual support.
Large Language Model
Transformers

Q
JunHowie
2,365
2
Gemma 3 R1984 27B Q6 K GGUF
GGUF format model converted from VIDraft/Gemma-3-R1984-27B, supporting multilingual text generation
Large Language Model Supports Multiple Languages
G
GrimsenClory
28
1
Reranker ModernBERT Large Gooaq Bce
Apache-2.0
This is a cross-encoder model fine-tuned from ModernBERT-large, used to calculate the scores of text pairs, suitable for text re-ranking and semantic search tasks.
Text Embedding English
R
tomaarsen
596
5
Croguana RC2 Gguf
Croatian text generation model based on Mistral architecture, trained with Unsloth acceleration
Large Language Model Other
C
Shome
55
1
Qwen2.5 QwQ 35B Eureka Cubed
Enhanced version of QwQ-32B, suitable for all usage scenarios, with outstanding reasoning and output capabilities.
Large Language Model
Transformers Other

Q
DavidAU
591
9
Thor V2.5 8b FANTASY FICTION 128K Q4 K M GGUF
This is a GGUF-format converted 8B-parameter language model specialized for fantasy fiction, supporting 128K context length.
Large Language Model English
T
MrRobotoAI
22
0
Frigg V2 8b ACADEMIC 128K Q4 K M GGUF
Frigg-v2-8b-ACADEMIC-128K-Q4_K_M-GGUF is an 8B-parameter large language model in GGUF format, suitable for various text generation tasks.
Large Language Model English
F
MrRobotoAI
18
0
Longwriter V 72B
Other
A multimodal large model fine-tuned on the LongWriter-V-22K dataset based on Qwen2.5-VL-72B-Instruct
Text-to-Image
Transformers

L
THU-KEG
15
2
L3.3 Cu Mai R1 70b
A 70B-parameter large language model based on the Llama3 architecture, specially optimized
Large Language Model
L
Steelskull
164
14
Italian ModernBERT Base
Apache-2.0
Italian ModernBERT is a specialized version of ModernBERT for Italian language, pre-trained specifically on Italian text.
Large Language Model
Transformers Other

I
DeepMount00
119
2
Modernbert Base Ita
Apache-2.0
ModernBERT is a modern bidirectional encoder-only Transformer model (BERT-style), pre-trained on 2 trillion tokens of English and code data, with a native context length of up to 8,192 tokens.
Large Language Model
Transformers Supports Multiple Languages

M
DeepMount00
81
10
KURE V1
MIT
KURE-v1 is an embedding model specifically optimized for Korean text retrieval, fine-tuned based on BAAI/bge-m3, and excels in Korean retrieval tasks.
Text Embedding
K
nlpai-lab
27.44k
37
Gte Base Ko
A sentence embedding model fine-tuned on the Korean triplet dataset based on the Alibaba-NLP/gte-multilingual-base model for semantic similarity calculation
Text Embedding Supports Multiple Languages
G
juyoungml
18
2
Gte Base Ko
This is a sentence-transformers model fine-tuned on a Korean triplet dataset based on Alibaba NLP/gte-multilingual-base, designed for semantic textual similarity tasks.
Text Embedding Supports Multiple Languages
G
scottsuk0306
18
2
Granite 3.0 3b A800m Instruct
Apache-2.0
A 3-billion parameter instruction-tuned language model developed by IBM, based on Granite-3.0 architecture, supporting multilingual tasks and commercial applications
Large Language Model
Transformers

G
ibm-granite
5,240
18
Magnum V2 72b
Other
This model is a large language model fine-tuned based on Qwen-2 72B Instruct, aiming to replicate the prose quality of the Claude 3 series of models and is the seventh version in the series of models.
Large Language Model
Safetensors Supports Multiple Languages
M
anthracite-org
302
39
Gte Base Korean
Apache-2.0
A Korean sentence embedding model fine - tuned on Alibaba - NLP/gte - multilingual - base, supporting tasks such as semantic text similarity calculation and semantic search.
Text Embedding
G
upskyy
1,436
4
Jais Family 1p3b Chat
Apache-2.0
Jais series 1.3 billion parameter Arabic-English bilingual large language model, optimized for exceptional Arabic capabilities while maintaining strong English proficiency
Large Language Model Supports Multiple Languages
J
inceptionai
479
6
Bert 1.3b
Apache-2.0
Transformer encoder pretrained based on Megatron-LM, specifically designed for Japanese scenarios
Large Language Model
Transformers Supports Multiple Languages

B
retrieva-jp
56
15
Llama 2 7b Ukrainian
Llama-2-7b-Ukrainian Version is a bilingual pre-trained model supporting Ukrainian and English, based on continued pre-training of Llama-2-7b using 5 billion tokens of data from CulturaX.
Large Language Model
Transformers Supports Multiple Languages

L
tartuNLP
141
2
Turkish Llama 8b V0.1 GGUF
Turkish-Llama-8b-v0.1 is a fully fine-tuned Turkish text generation model based on LLaMA-3 8B, trained on a 30GB Turkish dataset.
Large Language Model Other
T
LiteLLMs
108
2
Yi 1.5 34B Chat 16K
Apache-2.0
Yi-1.5 is an upgraded version of the Yi model, demonstrating superior performance in programming, mathematics, reasoning, and instruction-following capabilities.
Large Language Model
Transformers

Y
01-ai
807
27
Yi 1.5 9B Chat
Apache-2.0
Yi-1.5 is an upgraded version of the Yi model, excelling in programming, mathematics, reasoning, and instruction-following capabilities while maintaining outstanding language understanding, commonsense reasoning, and reading comprehension.
Large Language Model
Transformers

Y
01-ai
17.16k
143
Mlong T5 Tglobal Base Et Riigikogu Summary
Apache-2.0
This is an Estonian text summarization model based on the T5 architecture, specifically designed for summarizing stenographic records of the Estonian Parliament discussions.
Text Generation
Transformers Other

M
rristo
25
0
360zhinao 7B Base
Apache-2.0
360 Zhinao is an open-source large language model series developed by Qihoo 360, including base models and dialogue models with various context lengths, supporting both Chinese and English.
Large Language Model
Transformers Supports Multiple Languages

3
qihoo360
90
5
Mosaicml Mpt 7b Storywriter Bnb 4bit Smashed
PrunaAI's compressed MPT-7B story-writing model, enabling efficient inference through llm-int8 technology
Large Language Model
Transformers Other

M
PrunaAI
27
1
Bge M3 Zeroshot V2.0
MIT
A model specifically designed for efficient zero-shot classification, supporting multilingual text classification tasks without requiring training data
Text Classification
Transformers Other

B
MoritzLaurer
73.31k
49
Bge M3 Zeroshot V2.0 C
MIT
A multilingual zero-shot text classification model trained based on BAAI/bge-m3-retromae, specifically designed for commercial-friendly scenarios
Text Classification
Transformers Other

B
MoritzLaurer
67
13
Rubert Mini Sts
MIT
This is a base BERT model for computing compact embedding vectors of Russian sentences, developed based on cointegrated/rubert-tiny2, with the number of layers increased from 3 to 7.
Text Embedding
Transformers Other

R
sergeyzh
2,351
4
Qra 1b
Apache-2.0
Qra is a series of Polish-optimized large language models jointly developed by the Polish National Information Processing Institute and Gdańsk University of Technology, initialized based on TinyLlama-1.1B and trained on 90 billion Polish tokens
Large Language Model
Transformers

Q
OPI-PG
246
20
Ruropebert Classic Base 512
A Russian encoder model based on the RoPEBert architecture, trained using cloning methods, supports 512-token context, and surpasses the original ruBert-base model in quality
Large Language Model
Transformers Other

R
Tochka-AI
103
1
Polka 1.1b
Apache-2.0
polka-1.1b is a bilingual (Polish and English) text generation model enhanced by continuing pre-training on 5.7 billion Polish tokens based on the TinyLlama-1.1B model.
Large Language Model
Transformers Supports Multiple Languages

P
eryk-mazus
174
8
Bagel 34b V0.2
Apache-2.0
An experimental fine-tuned model based on yi-34b-200k, suitable for creative writing, role-playing, and other tasks, without DPO stage applied.
Large Language Model
Transformers

B
jondurbin
265
41
Law LLM 13B GGUF
Other
Law LLM 13B is a specific domain foundation model developed based on LLaMA-1-13B, focusing on tasks in the legal domain.
Large Language Model
Transformers English

L
TheBloke
420
8
Bce Reranker Base V1
Apache-2.0
A bilingual and cross-language reranking model optimized for RAG, supporting Chinese, English, Japanese, and Korean, providing explainable absolute scores
Text Embedding
Transformers Supports Multiple Languages

B
maidalun1020
68.29k
190
Titulm Mpt 1b V1.0
Apache-2.0
TituLM-1B-BN-V1 is a large language model specifically trained for generating and understanding Bengali text, extensively trained on a dataset containing 4.51 billion Bengali tokens.
Large Language Model
Transformers Other

T
hishab
61
11
- 1
- 2
Featured Recommended AI Models